Word Set Probability Boosting : The AB / TAB Algorithms
نویسندگان
چکیده
Based on the observation that the unpredictable nature of conversational speech makes it almost impossible to reliably model sequential word constraints, the notion of word set error criteria is proposed for improved recognition of spontaneous dialogues. The single pass Adaptive Boosting (AB) algorithm enables the language model weights to be tuned using the word set error criteria. In the two pass version of the algorithm, the basic idea is to predict a set of words based on some a priori information, and perform a re-scoring pass wherein the probabilities of the words in the predicted word set are ampliied or boosted in some manner. An adaptive gradient descent procedure for tuning the word boosting factor has been formulated which enables the boost factors to be incrementally adjusted to maximize accuracy of the speech recognition system outputs on held-out training data using the word set error criteria. Two novel models which predict the required word sets have been presented: utterance triggers which capture within-utterance long-distance word inter-dependencies, and dialogue triggers which capture local temporal dialogue-oriented word relations. The proposed Trigger and Adaptive Boosting (TAB) algorithm, and the single pass Adaptive Boosting (AB) algorithm have been experimentally tested on a subset of the TRAINS-93 spontaneous dialogues and the TRAINS-95 semi-spontaneous corpus, and have resulted in improved performances.
منابع مشابه
Improved spontaneous dialogue recognition using dialogue and utterance triggers by adaptive probability boosting
Based on the observation that the unpredictable nature of conversational speech makes it almost impossible to reliably model sequential word constraints, the notion of word set error criteria is proposed for improved recognition of spontaneous dialogues. The basic idea in the TAB algorithm is to predict a set of words based on some a priori information, and perform a re-scoring pass wherein the...
متن کاملBoosting Applied toe Word Sense Disambiguation
In this paper Schapire and Singer’s AdaBoost.MH boosting algorithm is applied to the Word Sense Disambiguation (WSD) problem. Initial experiments on a set of 15 selected polysemous words show that the boosting approach surpasses Naive Bayes and Exemplar–based approaches, which represent state–of–the–art accuracy on supervised WSD. In order to make boosting practical for a real learning domain o...
متن کاملBoosting Applied to Word Sense Disambiguation
In this paper Schapire and Singer s AdaBoost MH boosting algorithm is applied to the Word Sense Disambiguation WSD problem Initial experiments on a set of selected polysemous words show that the boosting approach surpasses Naive Bayes and Exemplar based ap proaches which represent state of the art accuracy on supervised WSD In order to make boosting practical for a real learning domain of thou ...
متن کاملUsing boosting and POS word graph tagging to improve speech recognition
The word graphs produced by a large vocabulary speech recognition system usually contain a path labelled with the correct utterance, but this is not always the highest scoring path. Boosting increases the probability of words which occur often in the word graph, which are in some sense robust. Adding syntactic information allows rescoring of arc probabilities with the possibility that more gram...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996